Deposit 

Cosmids 21A4-2-1, 21A4-4-3-1, 21A4-P5-1 have been deposited with the 
American Type Culture Collection on July 8, 1996, and bear the accession numbers 
ATCC No. 97649, 97650, and 97651. Plasmid pKExNPRl was deposited on July 31, 
1996 and bears the accession number ATCC No. 97671 . Applicants acknowledge their 
responsibility to replace these plasmids should it loose viability before the end of the term 
of a patent issued hereon, and their responsibility to notify the American Type Culture 
Collection of the issuance of such a patent, at which time the deposit will be made 
available to the public. Prior to that time the deposit will be made available to the 
Commissioner of Patents under terms of 37 CFR § 1.14 and 35 USC § 1 12. These 
deposits are available as required by foreign patent laws in countries wherein counterparts 
of this subject application, or progeny, are filed. It should be understood that availability 
of a deposit does not constitute a license to practice the subject invention. 

All publications and patent applications mentioned in this specification are 
herein incorporated by reference to the same extent as if each independent publication or 
patent application was specifically and individually indicated to be incorporated by 
reference. 
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25 (2) INFORMATION FOR SEQ ID NO : 1 : 



_ (i) SEQUENCE CHARACTERISTICS: 

*y (A) LENGTH: 754 8 base pairs 

Si (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



AAGCTTGTGA TGCAAGTCAT GGGATATTGC TTTGTGTTAA GTATACAAAA CCATCACGTG 6 0 

GATACATAGT CTTCAAACCA ACCACTAAAC AGTATCAGGT CAT AC C AAAG CCAGAAGTGA 120 

35 AGGGTTGGGA TATGT CAT TG GGTTTAGCGG TAATCGGATT GAACCCTTTC CGGTATAAAA 180 

TACAAAGGCT TTCGCAGTCT CGGCGTATGT GTATGTCTCG GGGTATCTAC CATTTGAATC 24 0 

ACAGAACTTT TATGTGCGAA GTTTTCGATT CTGATTCGTT TACCTGGAAG AGATTAGAAA 3 00 

TTTGCGTCTA C C AAAAAC AG ACAGATTAAT TTTTTCCAAC CCGATACAAG TTTCGGGGTT 3 60 

CTTGCATTGG ATATCACGGA ACAACAATGT GATCCGGTTT TGTCTCAAAA CCGAAACTTG 42 0 

40 GTCCTTCTTC CATACTCCGA ACTCTGATGT TTTCTCAGGA TTAGTCAGAT ACGAAGGGAA 4 80 

GCTAGGTGCT ATTCGTCAGT GGACAAACAA AGATCAAGAA GATGTTCACG AGTTATGGGT 54 0 
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TTTAAAGAGC AGTTTTGAAA AGTCGTGGGT TAAAGTGAAA GATATTAAAA GCATTGGAGT 6 00 

AGATTTGATT ACGTGGACTC CAAGCAACGA CGTTGTATTG TTTCGTAGTA GTGATCGTGG 660 

TTGCCTCTAC AACATAAACG CAGAGAAGTT GAATTTAGTT TATGCAAAAA AAGAGGGATC 72 0 

TGATTGTTCT TTCGTTTGTT TTCCGTTTTG TTCTGATTAC GAGAGGGTTG ATCTGAACGG 780 

5 AAGAAGCAAC GGGCCGACAC TTTAAAAAAA AAATAAAAAA AATGGGCCGA CAAATGCAAA 84 0 

CGTAGTTGAC AAGGATCTCA AGTCTCAAGT CTCAATTGGC TCGCTCATTG TGGGGCATAA 900 

ATATATCTAG TGATGTTTAA TTGTTTTTTA TAAGGTAAAA AGGAATATTG AATTTTGTTT 960 

CTTAGGTTTA TGTAATAATA CCAAACATTG TTTTATGAAT ATTTAATCTG ATTTTTTGGC 1020 

TAGTTATTTT ATTATATCAA GGGTTCCTGT TTATAGTTGA AAACAGTTAC TGTATAGAAA 10 80 

10 ATAGTGTCCC AATTTTCTCT C TT AAAT AAT AT ATTAGT T A ATAAAAGATA TTTTAATATA 114 0 

TTAGATATAC AATAATATCT AAAGCAACAC ATATTTAGAC ACAACACGTA ATATCTTACT 12 00 

ATTGTTTACA TATATTTATA GCTTACCAAT ATAACCCGTA TCTATGTTTT ATAAGCTTTT 1260 

ATACAATATA TGTACGGTAT GCTGTCCACG TATATATATT CTCCAAAAAA AACGCATGGT 1320 

ACACAAAATT TATTAAATAT TTGGCAATTG GGTGTTTATC TAAAGTTTAT CACAATATTT 13 80 

15 ATCAACTATA ATAGATGGTA GAAGATAAAA AAATTATATC AGATTGATTC AATTAAATTT 144 0 

TATAATATAT CATTTTAAAA AATTAATTAA AAGAAAACTA TTTCATAAAA TTGTTCAAAA 1500 

GATAATTAGT AAAATTAATT AAATATGTGA TGCTATTGAA TTATAGAGAG TTATTGTAAA 1560 

TTTACTTAAA AT CAT ACAAA TCTTATCCTA ATTTAACTTA TCATTTAAGA AATACAAAAG 1620 

TAAAAAACGC GGAAAGCAAT AATTTATTTA CCTTATTATA ACTCCTATAT AAAGTACTCT 1680 

20 GTTTATTCAA CATAATCTTA CGTTGTTGTA TTCATAGGCA TCTTTAACCT ATCTTTTCAT 174 0 

TTTCTGATCT CGATCGTTTT CGATCCAACA AAATGAGTCT ACCGGTGAGG AACCAAGAGG 1800 

^ TGATTATGCA GATTCCTTCT TCTTCTCAGT TTCCAGCAAC ATCGAGTCCG GAAAACACCA 1860 

«f ATCAAGTGAA GGATGAGCCA AATTTGTTTA GACGTGTTAT GAATTTGCTT TTACGTCGTA 1920 

CO GTTATTGAAA AAGCTGATTT ATCGCATGAT TCAGAACGAG AAGTTGAAGG CAAATAACTA 1980 

yj3 25 AAGAAGTCTT T T ATATGT AT ACAATAATTG TTTTTAAATC AAATCCTAAT TAAAAAAATA 2 04 0 

fl TATTCATTAT GACTTTCATG TTTTTAATGT AATTTATTCC TATATCTATA ATGATTTTTG 2100 

TTGTGAAGAG CGTTTTCATT TGCTATAGAA CAAGGAGAAT AGTTCCAGGA AATATTCGAC 2160 

TTGATTTAAT TATAGTGTAA ACATGCTGAA CACTGAAAAT TACTTTTTCA ATAAACGAAA 2220 

AATATAATAT ACATTACAAA ACTTATGTGA ATAAAGCATG AGACTTAATA TACGTTCCCT 2280 

^ 30 TTATCATTTT ACTTCAAAGA AAATAAACAG AAATGTAACT TTCACATGTA AATCTAATTC 234 0 

£ TTAAATTTAA AAAATAATAT TTATATATTT ATATGAAAAT AACGAACCGG ATGAAAAATA 24 00 

s AATTTTATAT ATTTATATCA TCTCCAAATC TAGTTTGGTT CAGGGGCTTA CCGAACCGGA 2460 

Q TTGAACTTCT CATATACAAA AATTAGCAAC ACAAAATGTC TCCGGTATAA ATACTAACAT 2 520 

fg TTATAACCCG AACCGGTTTA GCTTCCTGTT ATATCTTTTT AAAAAAGATC TCTGACAAAG 2580 

^ 35 ATTCCTTTCC TGGAAATTTA CCGGTTTTGG TGAAATGTAA ACCGTGGGAC GAGGATGCTT 264 0 

!ff CTTCATATCT CACCACCACT CTCGTTGACT GGACTTGGCT CTGCTCGTCA ATGGTTATCT 2 700 

TCGATCTTAA ACCAAATCCA GTTGATAAGG TCTCTTCGTT GATTAGCAGA GATCTCTTTA 2 760 

S ATTTGTGAAT TTCAATTCAT CGGAACCTGT TGATGGACAC CACCATTGAT GGATTCGCCG 2 820 

%J ATTCTTATGA AATCAGCAGC ACTAGTTTCG TCGCTACCGA TAACACCGAC TCCTCTATTG 2 880 

40 TTTATCTGGC CGCCGAACAA GTACTCACCG GACCTGATGT ATCTGCTCTG CAATTGCTCT 2 94 0 

CCAACAGCTT CGAATCCGTC TTTGACTCGC CGGATGATTT CTACAGCGAC GCTAAGCTTG 3 000 

TTCTCTCCGA CGGCCGGGAA GTTTCTTTCC ACCGGTGCGT TTTGTCAGCG AGAAGCTCTT 3 060 

TCTTCAAGAG CGCTTTAGCC GCCGCTAAGA AGGAGAAAGA CTCCAACAAC ACCGCCGCCG 3120 

TGAAGCTCGA GCTTAAGGAG ATTGCCAAGG ATTACGAAGT CGGTTTCGAT TCGGTTGTGA 3180 

45 CTGTTTTGGC TTATGTTTAC AGCAGCAGAG TGAGACCGCC GCCTAAAGGA GTTTCTGAAT 3 24 0 

GCGCAGACGA GAATTGCTGC CACGTGGCTT GCCGGCCGGC GGTGGATTTC ATGTTGGAGG 3 300 

TTCTCTATTT GGCTTTCATC TTCAAGATCC CTGAATTAAT TACTCTCTAT CAGGTAAAAC 3 360 

ACCATCTGCA TTAAGCTATG GTTACACATT CATGAATATG TTCTTACTTG AGTACTTGTA 3420 

TTTGTATTTC AGAGGC AC T T ATTGGACGTT GTAGACAAAG TTGTTATAGA GGACACATTG 3 4 80 

50 GTTATACTCA AGCTTGCTAA TATATGTGGT AAAGCTTGTA TGAAGCTATT GGATAGATGT 3 54 0 

AAAGAGATTA TTGTCAAGTC TAATGTAGAT ATGGTTAGTC TTGAAAAGTC ATTGCCGGAA 3 600 

GAGCTTGTTA AAGAGATAAT TGATAGACGT AAAGAGCTTG GTTTGGAGGT ACCTAAAGTA 3 660 

AAGAAACATG TCTCGAATGT ACATAAGGCA CTTGACTCGG ATGATATTGA GTTAGTCAAG 3 720 

TTGCTTTTGA AAGAGGATCA CACCAATCTA GATGATGCGT GTGCTCTTCA TTTCGCTGTT 3 780 

55 GCATATTGCA ATGTGAAGAC CGCAACAGAT CTTTTAAAAC TTGATCTTGC CGATGTCAAC 3 84 0 



05 
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CATAGGAATC CGAGGGGATA TACGGTGCTT CATGTTGCTG CGATGCGGAA GGAGCCACAA 3 900 

TTGATACTAT CTCTATTGGA AAAAGGTGCA AGTGCATCAG AAGCAACTTT GGAAGGTAGA 3 960 

ACCGCACTCA TGATCGCAAA ACAAGCCACT ATGGCGGTTG AATGTAATAA TATCCCGGAG 4 02 0 

CAATGCAAGC ATTCTCTCAA AGGCCGACTA TGTGTAGAAA TACTAGAGCA AGAAGACAAA 4 080 

5 CGAGAACAAA TTCCTAGAGA TGTTCCTCCC TCTTTTGCAG TGGCGGCCGA TGAATTGAAG 414 0 

ATGACGCTGC TCGATCTTGA AAATAGAGGT ATCTATCAAG TCTTATTTCT TATATGTTTG 42 00 

AATTAAATTT ATGTCCTCTC TATTAGGAAA CTGAGTGAAC TAATGATAAC TATTCTTTGT 4260 

GTCGTCCACT GTTTAGTTGC ACTTGCTCAA CGTCTTTTTC CAACGGAAGC ACAAGCTGCA 4320 

ATGGAGATCG CCGAAATGAA GGGAACATGT GAGT T CAT AG TGACTAGCCT CGAGCCTGAC 4 3 80 

10 CGTCTCACTG GTACGAAGAG AACATCACCG GGTGTAAAGA TAGCACCTTT CAGAATCCTA 444 0 

GAAGAGCATC AAAGTAGACT AAAAGCGCTT TCTAAAACCG GTATGGATTC TCACCCACTT 4 500 

CATCGGACTC CTTATCACAA AAAACAAAAC TAAATGATCT TTAAACATGG TTTTGTTACT 4 560 

TGCTGTCTGA CCTTGTTTTT TTATCATCAG TGGAACTCGG GAAACGATTC TTCCCGCGCT 4 620 

GTTCGGCAGT GCTCGACCAG ATTATGAACT GTGAGGACTT GACTCAACTG GCTTGCGGAG 4 680 

15 AAGACGACAC TGCTGAAGAA ACGACTACAA AAGAAGCAAA GGTACATGGA AATACAAGAG 4 74 0 

ACACTAAAGA AGGCCTTTAG TGAGGACAAT TTGGAATTAG GAAATTCGTC CCTGACAGAT 4 800 

TCGACTTCTT CCACATCGAA ATCAACCGGT GGAAAGAGGT CTAACCGTAA ACTCTCTCAT 4 860 

CGTCGTCGGT GAGACTCTTG CCTCTTAGTG TAATTTTTGC TGTACCATAT AATTCTGTTT 4 920 

TCATGATGAC TGTAACTGTT TATGTCTATC GTTGGCGTCA TATAGTTTCG CTCTTCGTTT 4 980 

20 TGCATCCTGT GTATTATTGC TGCAGGTGTG CTTCAAACAA ATGTTGTAAC AATTTGAACC 504 0 

AATGGTATAC AGATTTGTAA TATATATTTA TGTACATCAA CAATAACCCA TGATGGTGTT 5100 

^ ACAGAGTTGC TAGAATCAAA GTGTGAAATA ATGTCAAATT GTTCATCTGT TGGATATTTT 5160 

CCACCAAGAA CCAAAAGAAT ATTCAAGTTC CCTGAACTTC TGGCAACATT CATGTTATAT 5220 

05 GTATCTTCCT AATTCTTCCT TTAACCTTTT GTAACTCGAA TTACACAGCA AGTTAGTTTC 5280 

y3 25 AGGTCTAGAG ATAAGAGAAC ACTGAGTGGG CGTGTAAGGT GCATTCTCCT AGTCAGCTCC 534 0 

p ATTGCATCCA ACATTTGTGA ATGACACAAG TTAACAATCC TTTGCACCAT TTCTGGGTGC 5400 

fTj ATACATGGAA ACTTCTTCGA TTGAAACTTC CCACATGTGC AGGTGCGTTC GCTGTCACTG 5460 

ATAGACCAAG AGACTGAAAG CTTTCACAAA TTGCCCTCAA ATCTTCTGTT TCTATCGTCA 5520 

TGACTCCATA TCTCCGACCA CTGGTCATGA GCCAGAGCCC ACTGATTTTG AGGGAATTGG 55 80 

*2 30 GCTAACCATT TCCGAGCTTC TGAGTCCTTC TTTTTGATGT CCTTTATGTA GGAATCAAAT 5640 

HF TCTTCCTTCT GACTTGTGGA TCCAGCCTGC TTCACAAGGC TCACCAGGTT GTAGTCTCCA 5 700 

s AAAATATCAT GGAATTGTAA GCAAAAACAA TCCAGACAGA ACCTGTGATA GACCCAAGGT 5760 

p TCTTGCCACA GTGATCCGGG TTCGTTAATA ACAGCAACTA TGTCCGGGTG AGGACTGGAG 5820 

m ACGAAGCAAA CGTCTTTCCT TTGTGTTACC TTCTCTCTGA TATTAGTGAG AAACCAACGC 58 80 

ps 35 CAACTATCAG TGGACACTTC TTTGGTAAGC GGAAAGCAAG CGGGAAAAAC AATCATCAGC 5 94 0 

!f? GTCGAGTCCT GAGGAAAATC ATCAATTTCA TAGGGGTACT TGCCGTTCAA GTCTTTTGAA 6000 

^ TCCACTATGA TCAGAGGTCT ACAGTGTTGA AACCCTTCAA TGGACTGTGG AAACGCCCAA 6060 

l 4J AACGCGCCAC CGAAGGATGC AAATTCAGGA TTAGGGAAAA GCTCATATTG CAGTCCACAA 6120 

SJ GTAGCCCATT AGATGAGTGA AATGCAGCCA ATTAGTTTAG GCAATACTCT GAAACTCTGA 6180 

40 TCTTTGATTA CTTCCTGTTC TGCTGCCCGC AGCTTTGAAG TTTTAAGCAT GTCACCAAAC 624 0 

TTTTCAACTC TGCTGTTAGA GTGGGTTGTA CCCTGATCAG ACACTCAATC TCTTCTGCTG 63 00 

CAAATTACAA GTTGAAGTTT TCCGGCTTAA TAGAACAACA AGTATGTGGA CCAACTACAC 6360 

TTAGTTATCT TAACAAGTCC ATGTTCTTCT ATTCAATCTG CCCGACGCGA CCAATTGCAT 6420 

TTCCATCTGA TGCATTTAAA CGTATACTCG TCCTTCTCAA TCTCTTGTAC TACACACTTT 64 80 

45 TGCTGCCCTC TAATGGAACA CCAGTCCACC GCCTTCTTCA GCTCATCCCT ATCTTTAAAA 654 0 

CACAACCCTA CACGCAATTC ATGAT CAT CA ATCCACAAAC TAGACAAAGT ACACTGTTTT 6600 

GAAGCACTCG AATCAACAAC ACCTTTACTT AATAAGCACG CATACGGTAA TACCTCTAAG 6660 

CCTGGCACAT TCAAACCTTG TGTGCATCAT CTGAACCCGA GTTTTTATCC GTTATTTCTC 672 0 

CATCCCCACC TCCACGAGTG CTACCATTTC CGAAGTCAGA ATTTTCCTCG TCTTCAATCC 6780 

50 ACCCGTTACT GTTACCCACT CCCTGAACCT CTAAACCATT ATCTCTCTCT ACTTTCACAG 684 0 

ATGCATGTGA CACATAATCA GTAGCTTCTT GGGGTTGTTG CGTCCTCTGT GTATTCGAGG 6900 

AACTAGCGGG ATATTCTATT ACGGATGAAC AAGCAGCATG ATCAGTAACA TTATCAGATG 6960 

TCGATTTCAC TTCCAAATAC AACTCCACAT TTCTTATAGA AGGATGATAA CTTGGAACTT 7020 

CAAGCATAGT CTCCAAACTA GTGTCGTTCA CTACATGAAG AAGTAGATAG ATAAAGAGAT 7080 

55 CCGGTGAAAC AACTACAGGA TACTTACCAA AATATATTGA ACACTGATTT CTGCAGCTGC 714 0 
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AATCCAAAAA TTGGATAAAG ACCATTCAAC AATGTACTTA ACGCAGTCTT TTGCCTAACC 72 00 

TTGACCGTTT TAGGAGTGGA TCCTTCATAG T AAAC AC CAT CAGGACCATA CTTGGTAGAA 726 0 

CCTTTCTCTC AAGGTTTCCA TCGCCATGAC CATAACAGTC CTGCAGTGAA TTCTAAGAAA 732 0 

AATGTAAAAA ATTTTGGCCT AAACTCATAA TTCTTAACAT ACGAAACCAT GGAGAACTCC 73 80 

5 ATGTCTAAAA AATAAAGGCT AAAGCTTTTT GGCGACAGAA GCAGATAAAT CCATTCAAAA 744 0 

CACATAAACT CTAAACAATA AACAGTGATA CTCAATACTA AGACTTGTAA AGGTCTACGT 7500 

AACTCAAAAC TGGAGAATTG TCAGATCGGG TGTGGCTAGT AGAAGCTT 754 8 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 2104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
15 (ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 93... 1871 
(D) OTHER INFORMATION: 

s 

yj 20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

S TCGATCTTTA ACCAAATCCA GTTGATAAGG TCTCTTCGTT GATTAGCAGA GATCTCTTTA 60 

ATTTGTGAAT TTCAATTCAT CGGAACCTGT TG ATG GAC ACC ACC ATT GAT GGA 113 

Met Asp Thr Thr lie Asp Gly 
1 5 



s 25 TTC GCC GAT TCT TAT GAA ATC AGC AGC ACT AGT TTC GTC GCT ACC GAT 161 

Q Phe Ala Asp Ser Tyr Glu lie Ser Ser Thr Ser Phe Val Ala Thr Asp 

if- 10 15 20 

£f AAC ACC GAC TCC TCT ATT GTT TAT CTG GCC GCC GAA CAA GTA CTC ACC 209 

Asn Thr Asp Ser Ser lie Val Tyr Leu Ala Ala Glu Gin Val Leu Thr 

S 30 25 30 35 

GGA CCT GAT GTA TCT GCT CTG CAA TTG CTC TCC AAC AGC TTC GAA TCC 2 57 

Gly Pro Asp Val Ser Ala Leu Gin Leu Leu Ser Asn Ser Phe Glu Ser 
40 45 50 55 

GTC TTT GAC TCG CCG GAT GAT TTC TAC AGC GAC GCT AAG CTT GTT CTC 305 

35 Val Phe Asp Ser Pro Asp Asp Phe Tyr Ser Asp Ala Lys Leu Val Leu 

60 65 70 

TCC GAC GGC CGG GAA GTT TCT TTC CAC CGG TGC GTT TTG TCA GCG AGA 353 

Ser Asp Gly Arg Glu Val Ser Phe His Arg Cys Val Leu Ser Ala Arg 
75 80 85 

40 AGC TCT TTC TTC AAG AGC GCT TTA GCC GCC GCT AAG AAG GAG AAA GAC 4 01 

Ser Ser Phe Phe Lys Ser Ala Leu Ala Ala Ala Lys Lys Glu Lys Asp 

90 95 100 
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TCC AAC AAC ACC GCC GCC GTG AAG CTC GAG CTT AAG GAG ATT GCC AAG 44 9 

Ser Asn Asn Thr Ala Ala Val Lys Leu Glu Leu Lys Glu lie Ala Lys 
105 110 115 

GAT TAC GAA GTC GGT TTC GAT TCG GTT GTG ACT GTT TTG GCT TAT GTT 4 97 

5 Asp Tyr Glu Val Gly Phe Asp Ser Val Val Thr Val Leu Ala Tyr Val 
120 125 130 135 

TAC AGC AGC AGA GTG AGA CCG CCG CCT AAA GGA GTT TCT GAA TGC GCA 54 5 

Tyr Ser Ser Arg Val Arg Pro Pro Pro Lys Gly Val Ser Glu Cys Ala 
140 145 150 

10 GAC GAG AAT TGC TGC CAC GTG GCT TGC CGG CCG GCG GTG GAT TTC ATG 5 93 

Asp Glu Asn Cys Cys His Val Ala Cys Arg Pro Ala Val Asp Phe Met 
155 160 165 

TTG GAG GTT CTC TAT TTG GCT TTC ATC TTC AAG ATC CCT GAA TTA ATT 641 
Leu Glu Val Leu Tyr Leu Ala Phe lie Phe Lys lie Pro Glu Leu lie 
15 170 175 180 

ACT CTC TAT CAG AGG CAC TTA TTG GAC GTT GTA GAC AAA GTT GTT ATA 689 
Thr Leu Tyr Gin Arg His Leu Leu Asp Val Val Asp Lys Val Val lie 
185 190 195 

GAG GAC ACA TTG GTT ATA CTC AAG CTT GCT AAT ATA TGT GGT AAA GCT 737 
20 Glu Asp Thr Leu Val lie Leu Lys Leu Ala Asn lie Cys Gly Lys Ala 
200 205 210 215 

TGT ATG AAG CTA TTG GAT AGA TGT AAA GAG ATT ATT GTC AAG TCT AAT 785 



00 

UJ Cys Met Lys Leu Leu Asp Arg Cys Lys Glu lie lie Val Lys Ser Asn 



220 225 230 



f\ 25 GTA GAT ATG GTT AGT CTT GAA AAG TCA TTG CCG GAA GAG CTT GTT AAA 833 

Val Asp Met Val Ser Leu Glu Lys Ser Leu Pro Glu Glu Leu Val Lys 
jff 235 240 245 

-20 GAG ATA ATT GAT AGA CGT AAA GAG CTT GGT TTG GAG GTA CCT AAA GTA 881 

y3 Glu lie lie Asp Arg Arg Lys Glu Leu Gly Leu Glu Val Pro Lys Val 

\l 30 250 255 260 

AAG AAA CAT GTC TCG AAT GTA CAT AAG GCA CTT GAC TCG GAT GAT ATT 92 9 

Lys Lys His Val Ser Asn Val His Lys Ala Leu Asp Ser Asp Asp lie 
265 270 275 

GAG TTA GTC AAG TTG CTT TTG AAA GAG GAT CAC ACC AAT CTA GAT GAT 977 
35 Glu Leu Val Lys Leu Leu Leu Lys Glu Asp His Thr Asn Leu Asp Asp 
280 285 290 295 

GCG TGT GCT CTT CAT TTC GCT GTT GCA TAT TGC AAT GTG AAG ACC GCA 1025 
Ala Cys Ala Leu His Phe Ala Val Ala Tyr Cys Asn Val Lys Thr Ala 
300 305 310 

40 ACA GAT CTT TTA AAA CTT GAT CTT GCC GAT GTC AAC CAT AGG AAT CCG 1073 
Thr Asp Leu Leu Lys Leu Asp Leu Ala Asp Val Asn His Arg Asn Pro 
315 320 325 
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AGG GGA TAT ACG GTG CTT CAT GTT GCT GCG ATG CGG AAG GAG CCA CAA 1121 
Arg Gly Tyr Thr Val Leu His Val Ala Ala Met Arg Lys Glu Pro Gin 
330 335 340 

TTG ATA CTA TCT CTA TTG GAA AAA GGT GCA AGT GCA TCA GAA GCA ACT 1169 
5 Leu lie Leu Ser Leu Leu Glu Lys Gly Ala Ser Ala Ser Glu Ala Thr 
345 350 355 

TTG GAA GGT AGA ACC GCA CTC ATG ATC GCA AAA CAA GCC ACT ATG GCG 1217 
Leu Glu Gly Arg Thr Ala Leu Met He Ala Lys Gin Ala Thr Met Ala 
360 365 370 375 

10 GTT GAA TGT AAT AAT ATC CCG GAG CAA TGC AAG CAT TCT CTC AAA GGC 1265 

Val Glu Cys Asn Asn He Pro Glu Gin Cys Lys His Ser Leu Lys Gly 
380 385 390 

CGA CTA TGT GTA GAA ATA CTA GAG CAA GAA GAC AAA CGA GAA CAA ATT 1313 
Arg Leu Cys Val Glu He Leu Glu Gin Glu Asp Lys Arg Glu Gin He 
15 395 400 405 

CCT AGA GAT GTT CCT CCC TCT TTT GCA GTG GCG GCC GAT GAA TTG AAG 13 61 

Pro Arg Asp Val Pro Pro Ser Phe Ala Val Ala Ala Asp Glu Leu Lys 

W 410 415 420 

Jj ATG ACG CTG CTC GAT CTT GAA AAT AGA GTT GCA CTT GCT CAA CGT CTT 14 09 

□ 20 Met Thr Leu Leu Asp Leu Glu Asn Arg Val Ala Leu Ala Gin Arg Leu 

425 430 435 

f£ 



TTT CCA ACG GAA GCA CAA GCT GCA ATG GAG ATC GCC GAA ATG AAG GGA 14 57 
Phe Pro Thr Glu Ala Gin Ala Ala Met Glu He Ala Glu Met Lys Gly 
*F 440 445 450 455 



B 25 ACA TGT GAG TTC ATA GTG ACT AGC CTC GAG CCT GAC CGT CTC ACT GGT 1505 

Thr Cys Glu Phe He Val Thr Ser Leu Glu Pro Asp Arg Leu Thr Gly 
460 465 470 



ACG AAG AGA ACA TCA CCG GGT GTA AAG ATA GCA CCT TTC AGA ATC CTA 1553 
^3 Thr Lys Arg Thr Ser Pro Gly Val Lys lie Ala Pro Phe Arg lie Leu 

SJ 30 475 480 485 

GAA GAG CAT CAA AGT AGA CTA AAA GCG CTT TCT AAA ACC GTG GAA CTC 1601 
Glu Glu His Gin Ser Arg Leu Lys Ala Leu Ser Lys Thr Val Glu Leu 
490 495 500 

GGG AAA CGA TTC TTC CCG CGC TGT TCG GCA GTG CTC GAC CAG ATT ATG 164 9 
35 Gly Lys Arg Phe Phe Pro Arg Cys Ser Ala Val Leu Asp Gin He Met 
505 510 515 

AAC TGT GAG GAC TTG ACT CAA CTG GCT TGC GGA GAA GAC GAC ACT GCT 16 97 
Asn Cys Glu Asp Leu Thr Gin Leu Ala Cys Gly Glu Asp Asp Thr Ala 
520 525 530 535 

40 GAG AAA CGA CTA CAA AAG AAG CAA AGG TAC ATG GAA ATA CAA GAG ACA 1745 

Glu Lys Arg Leu Gin Lys Lys Gin Arg Tyr Met Glu He Gin Glu Thr 
540 545 550 
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CTA AAG AAG GCC TTT AGT GAG GAC AAT TTG GAA TTA GGA AAT TCG TCC 1793 
Leu Lys Lys Ala Phe Ser Glu Asp Asn Leu Glu Leu Gly Asn Ser Ser 
555 560 565 

CTG ACA GAT TCG ACT TCT TCC ACA TCG AAA TCA ACC GGT GGA AAG AGG 1841 
5 Leu Thr Asp Ser Thr Ser Ser Thr Ser Lys Ser Thr Gly Gly Lys Arg 
570 575 580 

TCT AAC CGT AAA CTC TCT CAT CGT CGT CGG TGAGACTCTT GCCTCTTAGT GTA 1894 
Ser Asn Arg Lys Leu Ser His Arg Arg Arg 
585 590 

10 ATTTTTGCTG T AC C AT AT AA TTCTGTTTTC ATGATGACTG T AAC TGTTT A TGTCTATCGT 1954 

TGGCGTCATA TAGTTTCGCT CTTCGTTTTG CATCCTGTGT ATTATTGCTG CAGGTGTGCT 2 014 

TCAAACAAAT GTTGTAACAA TTTGAACCAA TGGTATACAG ATTTGTAATA TATATTTATG 2 074 

TACATCAACA ATAAAAAAAA AAAAAAAAAA 2104 

(2) INFORMATION FOR SEQ ID NO : 3 : 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 amino acids 
_ (B) TYPE: amino acid 

^ (C) STRANDEDNESS : single 

CO (D) TOPOLOGY: linear 

Q 20 (ii) MOLECULE TYPE: protein 

m 

ZZ (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Asp Thr Thr lie Asp Gly Phe Ala Asp Ser Tyr Glu lie Ser Ser 
42 1 5 10 15 

§ Thr Ser Phe Val Ala Thr Asp Asn Thr Asp Ser Ser lie Val Tyr Leu 

O 25 20 25 30 

Ala Ala Glu Gin Val Leu Thr Gly Pro Asp Val Ser Ala Leu Gin Leu 
2 35 40 45 

y Leu Ser Asn Ser Phe Glu Ser Val Phe Asp Ser Pro Asp Asp Phe Tyr 

lM 50 55 60 

-J3 30 Ser Asp Ala Lys Leu Val Leu Ser Asp Gly Arg Glu Val Ser Phe His 
kj 65 70 75 80 

Arg Cys Val Leu Ser Ala Arg Ser Ser Phe Phe Lys Ser Ala Leu Ala 

85 90 95 

Ala Ala Lys Lys Glu Lys Asp Ser Asn Asn Thr Ala Ala Val Lys Leu 

35 100 105 110 

Glu Leu Lys Glu lie Ala Lys Asp Tyr Glu Val Gly Phe Asp Ser Val 

115 120 125 

Val Thr Val Leu Ala Tyr Val Tyr Ser Ser Arg Val Arg Pro Pro Pro 
130 135 140 

40 Lys Gly Val Ser Glu Cys Ala Asp Glu Asn Cys Cys His Val Ala Cys 
145 150 155 160 

Arg Pro Ala Val Asp Phe Met Leu Glu Val Leu Tyr Leu Ala Phe lie 

165 170 175 

Phe Lys lie Pro Glu Leu lie Thr Leu Tyr Gin Arg His Leu Leu Asp 

45 180 185 190 

Val Val Asp Lys Val Val lie Glu Asp Thr Leu Val lie Leu Lys Leu 
195 200 205 



-79- 



Ala Asn lie Cys Gly Lys Ala Cys Met Lys Leu Leu Asp Arg Cys Lys 

210 215 220 

Glu lie lie Val Lys Ser Asn Val Asp Met Val Ser Leu Glu Lys Ser 
225 230 235 240 

Leu Pro Glu Glu Leu Val Lys Glu lie lie Asp Arg Arg Lys Glu Leu 

245 250 255 

Gly Leu Glu Val Pro Lys Val Lys Lys His Val Ser Asn Val His Lys 

260 265 270 

Ala Leu Asp Ser Asp Asp lie Glu Leu Val Lys Leu Leu Leu Lys Glu 

275 280 285 

Asp His Thr Asn Leu Asp Asp Ala Cys Ala Leu His Phe Ala Val Ala 

290 295 300 

Tyr Cys Asn Val Lys Thr Ala Thr Asp Leu Leu Lys Leu Asp Leu Ala 
305 310 315 320 

Asp Val Asn His Arg Asn Pro Arg Gly Tyr Thr Val Leu His Val Ala 

325 330 335 

Ala Met Arg Lys Glu Pro Gin Leu lie Leu Ser Leu Leu Glu Lys Gly 

340 345 350 

Ala Ser Ala Ser Glu Ala Thr Leu Glu Gly Arg Thr Ala Leu Met lie 

355 360 365 

Ala Lys Gin Ala Thr Met Ala Val Glu Cys Asn Asn lie Pro Glu Gin 

370 375 380 

Cys Lys His Ser Leu Lys Gly Arg Leu Cys Val Glu lie Leu Glu Gin 
385 390 395 400 

Glu Asp Lys Arg Glu Gin lie Pro Arg Asp Val Pro Pro Ser Phe Ala 

405 410 415 

Val Ala Ala Asp Glu Leu Lys Met Thr Leu Leu Asp Leu Glu Asn Arg 

420 425 430 

Val Ala Leu Ala Gin Arg Leu Phe Pro Thr Glu Ala Gin Ala Ala Met 

435 440 445 

Glu He Ala Glu Met Lys Gly Thr Cys Glu Phe He Val Thr Ser Leu 

450 455 460 

Glu Pro Asp Arg Leu Thr Gly Thr Lys Arg Thr Ser Pro Gly Val Lys 
465 470 475 480 

He Ala Pro Phe Arg He Leu Glu Glu His Gin Ser Arg Leu Lys Ala 

485 490 495 

Leu Ser Lys Thr Val Glu Leu Gly Lys Arg Phe Phe Pro Arg Cys Ser 

500 505 510 

Ala Val Leu Asp Gin He Met Asn Cys Glu Asp Leu Thr Gin Leu Ala 

515 520 525 

Cys Gly Glu Asp Asp Thr Ala Glu Lys Arg Leu Gin Lys Lys Gin Arg 

530 535 540 

Tyr Met Glu lie Gin Glu Thr Leu Lys Lys Ala Phe Ser Glu Asp Asn 
545 550 555 560 

Leu Glu Leu Gly Asn Ser Ser Leu Thr Asp Ser Thr Ser Ser Thr Ser 

565 570 575 

Lys Ser Thr Gly Gly Lys Arg Ser Asn Arg Lys Leu Ser His Arg Arg 
580 585 590 . 

Arg 



(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 9 amino acids 



(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



10 



Asn His Arg Asn Pro Arg Gly Tyr Thr Val Leu His Val Ala Ala Met 

15 10 15 

Arg Lys Glu Pro Gin Leu lie Leu Ser Leu Leu Glu Lys Gly Ala Ser 

20 25 30 

Ala Ser Glu Ala Thr Leu Glu Gly Arg Thr Ala Leu Met lie Ala Lys 
35 40 45 

Gin 



(2) INFORMATION FOR SEQ ID NO : 5 : 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

05 Asn Ala Lys Thr Lys Asn Gly Tyr Thr Ala Leu His Gin Ala Ala Gin 

*p 1 5 10 15 

1 Gin Gly His Thr His lie lie Asn Val Leu Leu Gin Asn Asn Ala Ser 

p 25 20 25 30 

Pro Asn Glu Leu Thr Val Asn Gly Asn Thr Ala Leu Ala lie Ala Arg 
%U 35 4 0 45 

W Arg 

ifi 

30 (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Lys Val Lys Lys His Val Ser Asn Val His Lys Ala Leu Asp Ser Asp 

15 10 15 

Asp lie Glu Leu Val Lys Leu Leu Leu Lys Glu Asp 

40 20 25 

(2) INFORMATION FOR SEQ ID NO : 7 : 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



Lys Thr Lys Asn Gly Leu Ser Pro Leu His Met Ala Thr Gin Gly Asp 

15 10 15 

His Leu Asn Cys Val Gin Leu Leu Leu Ser Arg Asn 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Lys His Val Ser Asn Val His Lys Ala Leu Asp Ser Asp Asp lie Glu 

15 10 15 

Leu Val Lys Leu Leu Leu Lys Glu Asp His Thr Asn Leu Asp Asp Ala 
20 25 30 

Cys 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



Asp Asp Ala Cys Ala Leu His Phe Ala Val Ala Tyr Cys Asn Val Lys 

15 10 15 

Thr Ala Thr Asp Leu Leu Lys Leu Asp Leu Ala Asp Val Asn His Arg 
20 25 30 

Asn 



(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Arg Gly Tyr Thr Val Leu His Val Ala Ala Met Arg Lys Glu Pro Gin 

15 10 15 

Leu lie Leu Ser Leu Leu Glu Lys Gly Ala Ser Ala Ser Glu Ala Thr 
20 25 30 

Leu 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Glu Gly Arg Thr Ala Leu Met He Ala Lys Gin Ala Thr Met Ala Val 

15 10 15 

Glu Cys Asn Asn He Pro Glu Gin Cys Lys His Ser Leu Lys Gly Arg 
20 25 30 

Leu 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Gly Thr Pro Leu His Leu Ala Ala 

1 5 
Leu Leu Leu Asp Gly Ala Asp Val 
20 

Asn Asn Leu Asp He Ala Glu Val 

35 40 
Met Arg Gin Ser He Asn Glu 
50 55 



Arg Gly His Val Glu Val Val Lys 

10 15 
Asn Ala Thr Lys Ala He Ser Gin 
25 30 
Lys Asn Pro Asp Asp Val Lys Thr 
45 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2172 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

GTGACTTTCT AACTATGGCT GAAATTGCAG AACGAAAAAG ACTTTCCATT TTTCACTTGA 6 0 

ATGAAACCCA AAATGGAAAT CTATCTCTCT TCTTCTTCTC TTTTACTACC TCCATTTCCA 12 0 

TGGCTTTCCC TCCTCTACCT TCCCTAGCTC TTTTCAATTT CTAGAATATT CTTTTCTTAG 180 

TCTGTAATTA TCTATAGCTC AATTTCTAAG ACAGAACTTA TGTAAGGCGG CTTTCTGTAA 24 0 

10 TGGATAATAG TAGGACTGCG TTTTCTGATT CGAATGACAT CAGCGGAAGC AGTAGTATAT 3 00 

GCTGCATCGG CGGCGGCATG ACTGAATTTT TCTCGCCGGA GACTTCGCCG GCGGAGATCA 3 60 

CTTCACTGAA ACGCCTATCG GAAACACTGG AATCTATCTT CGATGCGTCT TTGCCGGAGT 420 

TTGACTACTT CGCCGACGCT AAGCTTGTGG TTTCCGGCCC GTGTAAGGAA ATTCCGGTGC 4 80 

ACCGGTGCAT TTTGTCGGCG AGGAGTCCGT TCTTTAAGAA TTTGTTCTGC GGTAAAAAGG 54 0 

15 AGAAGAATAG TAGTAAGGTG GAATTGAAGG AGGTGATGAA AGAGCATGAG GTGAGCTATG 6 00 

ATGCTGTAAT GAGTGTATTG GCTTATTTGT ATAGTGGTAA AGTTAGGCCT TCACCTAAAG 660 

ATGTGTGTGT TTGTGTGGAC AATGACTGCT CTCATGTGGC TTGTAGGCCA GCTGTGGCAT 72 0 

TCCTGGTTGA GGTTTTGTAC ACATCATTTA CCTTTCAGAT CTCTGAATTG GTTGACAAGT 78 0 

TTCAGAGACA CCTACTGGAT ATTCTTGACA AAACTGCAGC AGACGATGTA ATGATGGTTT 84 0 

P 20 TATCTGTTGC AAACATTTGT GGTAAAGCAT GCGAGAGATT GCTTTCAAGC TGCATTGAGA 900 

fg TTATTGTCAA GTCTAATGTT GATATCATAA CCCTTGATAA AGCCTTGCCT CATGACATTG 960 

~£i TAAAACAAAT TACTGATTCA CGAGCGGAAC TTGGTCTACA AGGGCCTGAA AGCAACGGTT 102 0 

~ TTCCTGATAA ACATGTTAAG AGGATACATA GGGCATTGGA TTCTGATGAT GTTGAATTAC 1080 

^ TACAAATGTT GCTAAGAGAG GGGCATACTA CCCTAGATGA TGCATATGCT CTCCATTATG 114 0 

65 25 CTGTAGCGTA TTGCGATGCA AAGACTACAG CAGAACTTCT AGATCTTGCA CTTGCTGATA 12 00 

Qy TTAATCATCA AAATTCAAGG GGATACACGG TGCTGCATGT TGCAGCCATG AGGAAAGAGC 12 60 

(2 CTAAAATTGT AGTGTCCCTT TTAACCAAAG GAGCTAGACC TTCTGATCTG ACATCCGATG 132 0 

j* GAAGAAAAGC ACTTCAAATC GCCAAGAGGC TCACTAGGCT TGTGGATTTC AGTAAGTCTC 13 80 

J" CGGAGGAAGG AAAATCTGCT TCGAATGATC GGTTATGCAT TGAGATTCTG GAGCAAGCAG 144 0 

30 AAAGAAGAGA CCCTCTGCTA GGAGAAGCTT CTGTATCTCT TGCTATGGCA GGCGATGATT 1500 

^ff TGCGTATGAA GCTGTTATAC CTTGAAAATA GAGTTGGCCT GGCTAAACTC CTTTTTCCAA 1560 

U? TGGAAGCTAA AGTTGCAATG GACATTGCTC AAGTTGATGG CACTTCTGAG TTCCCACTGG 1620 

Q CTAGCATCGG CAAAAAGATG GCTAATGCAC AGAGGACAAC AGTAGATTTG AACGAGGCTC 16 80 

jpfj CTTTCAAGAT AAAAGAGGAG CACTTGAATC GGCTTAGAGC ACTCTCTAGA ACTGTAGAAC 174 0 

."f* 35 TTGGAAAACG CTTCTTTCCA CGTTGTTCAG AAGTTCTAAA TAAGATCATG GATGCTGATG 18 00 

^ ACTTGTCTGA GATAGCTTAC ATGGGGAATG ATACGGCAGA AGAGCGTCAA CTGAAGAAGC 186 0 

^ AAAGGTACAT GGAACTTCAA GAAATTCTGA CTAAAGCATT CACTGAGGAT AAAGAAGAAT 192 0 

ATGATAAGAC TAACAACATC TCCTCATCTT GTTCCTCTAC ATCTAAGGGA GTAGATAAGC 198 0 

CCAATAAGCT CCCTTTTAGG AAATAGGTAA TTGTATTAGG ATATATGAGG AAGAAGAGGA 2 04 0 

40 TTTTCTTGTA ACATAGCACT CTTTCCTTTC ATCATTTGAT ATGTCAACAT ACATACAACA 2100 

> . GCTGTACCAT AAACTTGTAT TGTTGCACTT ACAACTTTGA AGAACAGAAT TTATTTGAAA 2160 

{.) AAAAAAAAAA AA 2172 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 588 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE 



DESCRIPTION 



SEQ ID 



NO: 14 : 



Met Asp Asn Ser 
1 

Ser Ser Ser lie 
20 

Pro Glu Thr Ser 
35 

Thr Leu Glu Ser 
50 

Ala Asp Ala Lys 
65 

His Arg Cys lie 

Cys Gly Lys Lys 
100 

Met Lys Glu His 
115 

Tyr Leu Tyr Ser 
130 

Cys Val Asp Asn 
145 

Phe Leu Val Glu 

Leu Val Asp Lys 
180 

Ala Ala Asp Asp 
195 

Lys Ala Cys Glu 
210 

Ser Asn Val Asp 
225 

Val Lys Gin lie 

Glu Ser Asn Gly 
260 

Leu Asp Ser Asp 
275 

His Thr Thr Leu 
290 

Cys Asp Ala Lys 
305 

lie Asn His Gin 

Met Arg Lys Glu 
340 

Arg Pro Ser Asp 
355 

Lys Arg Leu Thr 
370 

Lys Ser Ala Ser 
385 

Glu Arg Arg Asp 
Ala Gly Asp Asp 



Arg Thr Ala Phe 
5 

Cys Cys lie Gly 

Pro Ala Glu lie 
40 

lie Phe Asp Ala 
55 

Leu Val Val Ser 
70 

Leu Ser Ala Arg 
85 

Glu Lys Asn Ser 

Glu Val Ser Tyr 
120 

Gly Lys Val Arg 
135 

Asp Cys Ser His 
150 

Val Leu Tyr Thr 
165 

Phe Gin Arg His 

Val Met Met Val 
200 

Arg Leu Leu Ser 
215 

He He Thr Leu 
230 

Thr Asp Ser Arg 
245 

Phe Pro Asp Lys 

Asp Val Glu Leu 
280 

Asp Asp Ala Tyr 
295 

Thr Thr Ala Glu 
310 

Asn Ser Arg Gly 
325 

Pro Lys He Val 

Leu Thr Ser Asp 
360 

Arg Leu Val Asp 
375 

Asn Asp Arg Leu 
390 

Pro Leu Leu Gly 
405 

Leu Arg Met Lys 



Ser Asp Ser Asn 
10 

Gly Gly Met Thr 
25 

Thr Ser Leu Lys 

Ser Leu Pro Glu 
60 

Gly Pro Cys Lys 
75 

Ser Pro Phe Phe 
90 

Ser Lys Val Glu 
105 

Asp Ala Val Met 

Pro Ser Pro Lys 
140 

Val Ala Cys Arg 
155 

Ser Phe Thr Phe 
170 

Leu Leu Asp He 
185 

Leu Ser Val Ala 

Ser Cys lie Glu 
220 

Asp Lys Ala Leu 
235 

Ala Glu Leu Gly 
250 

His Val Lys Arg 
265 

Leu Gin Met Leu 

Ala Leu His Tyr 
300 

Leu Leu Asp Leu 
315 

Tyr Thr Val Leu 
330 

Val Ser Leu Leu 
345 

Gly Arg Lys Ala 

Phe Ser Lys Ser 
380 

Cys He Glu He 
395 

Glu Ala Ser Val 
410 

Leu Leu Tyr Leu 



Asp lie Ser Gly 
15 

Glu Phe Phe Ser 
30 

Arg Leu Ser Glu 
45 

Phe Asp Tyr Phe 

Glu He Pro Val 
80 

Lys Asn Leu Phe 
95 

Leu Lys Glu Val 
110 

Ser Val Leu Ala 
125 

Asp Val Cys Val 

Pro Ala Val Ala 
160 

Gin lie Ser Glu 
175 

Leu Asp Lys Thr 
190 

Asn lie Cys Gly 
205 

lie lie Val Lys 

Pro His Asp lie 
240 

Leu Gin Gly Pro 
255 

lie His Arg Ala 
270 

Leu Arg Glu Gly 
285 

Ala Val Ala Tyr 

Ala Leu Ala Asp 
320 

His Val Ala Ala 
335 

Thr Lys Gly Ala 
350 

Leu Gin lie Ala 
365 

Pro Glu Glu Gly 

Leu Glu Gin Ala 
400 

Ser Leu Ala Met 
415 

Glu Asn Arg Val 



420 425 430 

Gly Leu Ala Lys Leu Leu Phe Pro Met Glu Ala Lys Val Ala Met Asp 

435 440 445 

lie Ala Gin Val Asp Gly Thr Ser Glu Phe Pro Leu Ala Ser lie Gly 
5 450 455 460 

Lys Lys Met Ala Asn Ala Gin Arg Thr Thr Val Asp Leu Asn Glu Ala 
465 470 475 480 

Pro Phe Lys lie Lys Glu Glu His Leu Asn Arg Leu Arg Ala Leu Ser 
485 490 495 

10 Arg Thr Val Glu Leu Gly Lys Arg Phe Phe Pro Arg Cys Ser Glu Val 
500 505 510 

Leu Asn Lys lie Met Asp Ala Asp Asp Leu Ser Glu lie Ala Tyr Met 

515 520 525 

Gly Asn Asp Thr Ala Glu Glu Arg Gin Leu Lys Lys Gin Arg Tyr Met 
15 530 535 540 

Glu Leu Gin Glu lie Leu Thr Lys Ala Phe Thr Glu Asp Lys Glu Glu 
545 550 555 560 

Tyr Asp Lys Thr Asn Asn lie Ser Ser Ser Cys Ser Ser Thr Ser Lys 
565 570 575 

20 Gly Val Asp Lys Pro Asn Lys Leu Pro Phe Arg Lys 
580 585 

Q (2) INFORMATION FOR SEQ ID NO: 15: 

yg (i) SEQUENCE CHARACTERISTICS: 

p (A) LENGTH: 21 base pairs 

f%~ 25 (B) TYPE: nucleic acid 

JS' (C) STRANDEDNESS : single 

jjf (D) TOPOLOGY: linear 

*p (ii) MOLECULE TYPE: Genomic DNA 

s 

f*§ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ffz 

™ 30 GTGACAGACT TGCTCCTACT G 21 

i : 
to? 

03 (2) INFORMATION FOR SEQ ID NO: 16: 

-ill 

%J (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 
"35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: . 
CAGTGTGTAT CAAAGCACCA 2 0 

40 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
TTCTCCAGAC CACATGATTA T 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
TGAAGCTAAT ATGCACAGGA G 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
GTAGGTGCTC TTGTTCTTCC C 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

CACATAATTC CCACGAGGAT C 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 amino acids 



# 



(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

Met Lys Gly Thr Cys Glu Phe lie Val Thr Ser Leu Glu Pro Asp Arg 

15 10 15 

Leu 

10 (2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

O (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

y3 Arg Arg Lys Glu Leu Gly Leu Glu Val Pro Lys Val Lys Lys 

p| 1 5 10 

l Z2 20 (2) INFORMATION FOR SEQ ID NO:23: 

to (i) SEQUENCE CHARACTERISTICS: 

=P (A) LENGTH: 14 amino acids 

g (B) TYPE: amino acid 

f% (C) STRANDEDNESS: single 

S 25 (D) TOPOLOGY: linear 

J» (ii) MOLECULE TYPE: protein 

tQ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

SJ 

Lys Lys Gin Arg Tyr Met Glu lie Gin Glu Thr Leu Lys Lys 
15 10 

30 (2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
AARGARGAYC AYACNAA 17 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
TAYGTYAAYG TNAARAC 

(2) INFORMATION FOR SEQ ID NO: 26: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:; 
GCCATNGTNG CYTGYTT 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AARGTNAARA ARCAYGT 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:28: 

RAAYTCRCAN GTNCCYTTCA T 



